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Two recent preprints [B. Altshuler, H. Krovi, and J. Roland, "Quantum adiabatic optimization 

fails for random instances of NP-complete problems", arXiv:0908.2782 and "Anderson localization 

casts clouds over adiabatic quantum optimization", ,arXiv:0912.0746 argue that random 4th order 

perturbative corrections to the energies of local minima of random instances of NP-complete problem 

lead to avoided crossings that cause the failure of quantum adiabatic algorithm (due to exponentially 

small gap) close to the end, for very small transverse field that scales as an inverse power of instance 

j-». , size A'^. The theoretical portion of this work does not to take into account the exponential degeneracy 

*vj ' of the ground and excited states at zero field. A corrected analysis shows that unlike those in the 

' middle of the spectrum, avoided crossings at the edge would require high [0(1)] transverse fields, at 

i^T*! which point the perturbation theory may become divergent due to quantum phase transition. This 

effect manifests itself only in large instances [exp(0.027V) 3> 1], which might be the reason it had 

not been observed in the authors' numerical work. While we dispute the proposed mechanism of 

failure of quantum adiabatic algorithm, we cannot draw any conclusions on its ultimate complexity. 






,_i Quantum adiabatic algorithm. The quantum adiabatic algorithm is a generic algorithm proposed for the solution 

,.£5 of a variety of combinatorial optimization and decision problems involving binary variables from the NP-complete 

Mh family. One considers a Hamiltonian involving N qubits Xi (or N spins) dependent on parameter A: 
-(— > 

§! H{X) = ^ E{x)\x){x\- X^ ^ \xi...Xk.-.XN){xi...Xk-..XN\, (1) 

^ ■ xe{0,l}" fe a;e{0,l}" 

1—^, where Xk = 1 — Xk represents qubit flip. Here E{x) is the cost function corresponding to bit assignment x = 
(xi, . . . ,xn)- a solution is verifiable in polynomial time (but finding one may take exponential time) hence the 

►^ , Hamiltonian is implementable using only polynomially large number of gadgets. The system is initially prepared 

I • in a state that is the symmetric superposition of all 2^ possible bit assignments — an exact ground state of the 

^__l [ second ("driver") term in ([T]), which corresponds to the uniform magnetic field A in the direction orthogonal to the 

f^ . quantization axis of computational basis |a;). The parameter A is changed in time from A(0) S> 1 initially to A(r) = 

C^ ' at the end of the algorithm. By adiabatic theorem, the system will remain in its ground state with high probability 

ly-s , provided that dX/dt <C A^(A), where A(A) = £^i(A) — Eo{X) is the energy gap between the ground state of H{X) and 

f^ ' its first excited state. At time t = T the system will be in a superposition state of configurations with the optimal 

^^ cost and one of the optimal solutions may be obtained by preforming a final measurement on the qubits. The running 

^~^ time of the algorithm (its complexity) is given by T ^ l/A^j^^, where Ami„ is the minimum value of the gap as a 

►>. \ function of A. The value A := A* for which the gap is minimal will be referred to as the bottleneck of the algorithm. 

• "^ ■ . 

k>( ^ It is known that the minimum gap can be exponentially small in N in the worst case. A really interesting question 

5h ' ^^ \iOw adiabatic algorithm performs on random instances of combinatorial optimization problems — the typical-case 

Ci ■ complexity. Historically, the benchmark problem for the quantum adiabatic algorithm has been the exact cover 

problem. An instance of random exact cover problem is a set of N bits and M clauses, each clause C containing 

three bits {xi^ , Xj^ , x^^ ) chosen uniformly at random. One seeks an assignment such that bits in each clause add 

up to 1: Xi^ -\- Xj^ + Xkc = 1. A cost (xi^ + Xj^ -\- Xkc ~ 1)^ ^ is assigned to each clause so that the total cost 

E(x) (given by the sum over individual clauses) is zero for satisfying assignments. In terms of Pauli operators [where 

d'f\xi) = {—l)^'\xi), d-f\xi) — \xi)\, the quantum Hamiltonian is written as (cf. Eq. (2) of Ref. [l|): 

H{X) =M-\Y. ^^^l + ^ E ^l^] + ^l^l + '^I'^fc) - ^ E '^?^' (2) 

i (ijk) i 

where Bi is the number of clauses in which bit i appears and the sum in the third term is over all clauses {ijk). 
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The first three terms describe the problem Hamiltonian that is diagonal in (T^-representation, while the fourth term 
describes a magnetic field in the transverse direction. 

It has been observed that the bottleneck of simulated annealing (which can be thought of as a classical counterpart of 
quantum adiabatic algorithm) is the vicinity of temperature-driven phase transition. One might conjecture that the 
bottleneck of quantum adiabatic algorithm is the vicinity of transverse-field-driven quantum phase transition at finite 
A = Ac > 0. This indeed had been confirmed in a few random NP-complete problems [3]. However, there is no reason 
to expect that this scenario is universal; even the existence of the phase transition cannot be guaranteed in some models 
[J| . Ref . [l| asserts that the bottleneck of the quantum adiabatic algorithm for random exact cover is unconnected to 
the quantum phase transition but is due to "accidental" avoided crossings of energy levels corresponding to localized 
states for infinitesimal transverse fields (A — >■ as A^ — > oo). The associated gap is related to the overlap between 
localized states and is expected to be exponentially small. It is claimed that this mechanism is not peculiar to the 
random exact cover problem, but applies to a wide range of NP-complete problems defined on random hypergraphs. 
The possibility of avoided crossings for A < Ac had been raised before, for a model with a quasi-continuous (level 
spacings <^ 1) spectrum [5,], but was thought not to occur for models with a discrete spectrum, such as exact cover, 
K-SAT, etc. Ref. [ij predicts avoided crossings close to the end of the algorithm whereas recent quantum Monte Carlo 
(QMC) simulations show the bottleneck in the middle of the algorithm [6]. However, QMC studies consider a different 
ensemble (extremely rare instances with a unique satisfying assignment are chosen) and the problem sizes considered 
{N = 256) may be too small (Ref. [l| estimates that the described mechanism may not kick in until N ~ 10^). We 
will demonstrate that the exponential degeneracy of the ground state, which is a distinguishing feature of random 
NP-complete problems with discrete spectrum addressed in [l|, dooms the proposed mechanism. Note that when the 
instance is not drawn from a uniformly random ensemble but is instead crafted to contain exactly one global and one 
local minimum separated by N bit fiips, the avoided crossing does take place for A — > ^]. 

Overview of perturbation theory analysis. Ref. [l| starts with the classical Hamiltonian for an instance with N 
bits and M clauses and develops a perturbation theory in a small parameter A <C 1. In this limit the perturbation 
theory is expected to be locally convergent. One may consider a global minimum E{xo) = and a local minimum at 
E{xi) — 1. For small A > 0, these energy levels acquire perturbative corrections 

eH^KX) « <5i?(f )(A), Ei^'HX) « 1 + 6Ei]'\X). (3) 

For some value A such that SExq (A) — SEx^ (A) ~ 1, the levels corresponding to states localized near x = Xq and 
X = Xi will be equal in energy. The minimum gap will be non-zero, but exponentially small, provided that Xq and 
xi differ by 0{N) bit flips. 

In a drastic simplification, Ref. [l| uses a clever trick to show that It suffices to examine only the global minima. 
Once a clause contradicting Xi is removed, both Xq and Xi will have zero cost. Writing Ex (A) for the energy of 
eigenstate localized near x for the instance with M — 1 clauses, and A* denoting the solution to 

i?ir'HA*)-4f-^)(A.) = l, (4) 

it can be argued that an instance with AI clauses should have an avoided crossing for some A < A* . This follows from 
inequality satisfied for small A, 

< E(/^\X) - Ei^'-^HX) < 1, (5) 

obtained by treating the Af-th clause as a perturbation. Eqs. (|4]) and ([5]) together imply Ex^ (A*) > E'xi (A*), but 
the opposite inequality holds for A = 0. Therefore, the energies, being continuous functions of A. must be equal for 
some A < A*. This construction is visualized in Fig. [T] (left). 

In general, one considers a random instance with M — 1 clauses chosen uniformly at random and some solution xi. 
A new uniformly random instance with M clauses is formed by adding a new random clause. With finite probability, 
the new clause is violated by Xi so that levels corresponding to Xi and some other solution Xq satisfied by the new 
clause cross for A such that A_Eio = Ex^ (A) — Ex^ (A) ^ 1. Failing that, random instances with M + 1, M + 2, 
etc. clauses may be generated by adding more random clauses, which ensures that avoided crossing takes place with 
probability tending to one. 

The necessary condition for this mechanism is the convergence of the perturbation theory. Ref. [1] justifies its use 
by showing that A* — )> as iV — >■ oo. Within ordinary (non-degenerate) perturbation theory up to the 4th order, the 
energy of the state corresponding to a solution with zero cost is (cf. Eq. (33) in Ref. [1]) 

ExiX) ~ common term + A > -^ —Xi H -^, — —Xi -\ ^- — — —Xh . (6) 

^' f^\l~^/{B,+BkY ' l~A/{B, + BkY ' l-4/(B, + B,)2 V "- ' 
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FIG. 1: Left: Top figure shows the intersection of energy levels corresponding to a global [E{xo) ~ 0] and a local [E{xi) = 1] 
minimum. Bottom figure shows two levels corresponding to global minima for an instance with one less clause. When the 
level splitting equals one at A = A*, the levels of the original instance will cross for somewhat smaller A. Right: The effect of 
exponential degeneracy is depicted here. Level splitting (bottom figure) is large only for randomly chosen levels (blue lines) 
so they will intersect (top figure) with high probability. Level with the smallest perturbation correction {xq and xi) may not 
intersect until A is large as the level splitting does not scale with A^. 



The common term is the same for all configurations but the second term is configuration-dependent. For uniformly 
random ensemble, Bi [defined in the text surrounding Eq. ([2])] are random Poisson-distributed variables with mean 
3M/N = 0(1), each term in the sum over M = 0{N) clauses is a random 0(1) variable. By central limit theorem, 
the sum is approximately a Gaussian of width 0{^/N) so that for two different bit configurations AEiq ~ vWA"*. 
Therefore, Ref. [ij] claims that avoided crossings take place for A ~ 1/iV^/^ <^ 1, well within the region of applicability 
of perturbation theory. 

Effects of exponential degeneracy. The argument at the end of the previous section overlooks the fact that the 
values of Xq and Xi are correlated with realizations of random instances. While in many circumstances neglecting 
correlations may not lead to qualitative changes, an important factor in this case is the large number of solutions with 
zero cost. Even if perturbative corrections to all solutions are assumed independent random variables, it can only be 
established that randomly chosen energy levels corresponding to E — and E = 1 may intersect for A ~ 1/N^^^, as 
depicted in Fig. [T] (right). Since we are interested in the intersections with the ground state, we require that Xi and 
Xq correspond to the ground states of Hamiltonian with M — 1 and M clauses respectively, i.e. that Ex^ (A) and 

Exo (X) be smallest. But with this restriction, we will see that Exi,{X) — Ex^^iX) ^^ A'', so that avoided crossings are 
unlikely until A ^ 1 , which may be outside the radius of convergence of perturbation theory. 

It is important to realize that the number of solutions of random exact cover is exponential in N, even near the 
satisfiability threshold as = M/N sa 0.626 where the random instance is satisfiable with probability 1/2. Just prior 
to adding a random clause which makes an instance unsatisfiable, some bits are frozen (have the same values in all 
solutions) while others are not. The latter, "soft" bits, contribute to the exponential degeneracy. In the numerical 
simulations of Ref. [l| all bits that do not appear in any clause as well as clauses with two or more bits that do not 
belong to any other clause are removed. This ensures that flipping two bits does not lead to another solution E — 
as that would make the expression ^ formally infinite, indicating that a degenerate perturbation theory should be 
used instead. Such hypergraph trimming does not affect the satisfiability of the instance, can be done in polynomial 
time prior to running quantum adiabatic algorithm, and removes "trivial degeneracies". However, it does not remove 
all degeneracies: there will remain soft bits; moreover whether a given a given bit is soft depends on the assignment 



of "hard", or frozen bits. In Fig. [3] (left) we plot the number of solutions of trimmed hypergraph as a function of N 
at satisfiability threshold. It is seen that the number of solutions grows exponentially as A/" ~ exp(0.021Af). The 
smallness of exponent is the reason this effect only starts to manifest itself for N > 100. 
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FIG. 2: Left: Number of solutions (for satisfiable instances) as a function of A'^. The data fit exponential dependence M ~ 
exp(cA'') with c ~ 0.0209 ±0.0002. Right: The difference between the largest and the smallest 4th order perturbative correction 
to the ground state, as a function of A'^. A linear fit with the coefficient 0.0535 ±0.0004 is obtained. A leaf removal algorithm has 
been applied to randomly generated instances to insure that no clause contains more than one bit not appearing in other clauses. 
Errorbars correspond to one standard deviation (68% confidence interval). Linear fits are for the interval 100 ^ A'^ ^ 1000. 

Once the presence of exponentially many solutions [JV ^ exp(cA^)] is taken into account, the density of states Ex{X) 
for all solutions is written as 



p{E)=M 



1 



V2TTNa 



exp 



2a^N 



I 



exp 



cN - 



{E-Ef 



2a^N 



(7) 



where the energy levels are assumed to have Gaussian distribution with mean E and variance ayN, and where 
a — ©(A**). Then the energy of the ground state £'o('^) (corresponding to configuration with the smallest perturbation 
theory correction) can be estimated by solving p{Eq) « 1. This implies 



En 



E - NaV2c + O 



logN 

N 



(8) 



Notice that this correction is proportional to N rather than \/ N . The fluctuations of Eq are 0{a) and have a Gumbel 
distribution Q. This linear scaling is verified numerically in Fig. [5] (right) where we plot the difference of the largest 
perturbative correction and the snrallest perturbative correction (so that the common term cancels out) as a function 
of A^. 

Next, we show that the level spacing is only 0{a) = O(A^) and does not scale with A^. Let us compute the probability 
that the gap to the first excited state is at least A. First, pick a reference energy E'lof and write down the probability 
that exp(cA^) — 1 levels have higher energy and 1 level has energy £'o — -Erof — A: 



exp(cA^) 



-(E~Ef/{2a^N) dE' 



a\/2nN 



cxp(cAf)-l 



1 



a\/2TTN 



e-(^'--^-^)'/(2-'^)d£;rrf. 



This expression is non- negligible only if E^d — E ^ —Ng\/2c. The desired expression is an integral over £^ref , 



v(Ex -Eo> A)= exp 



^ ^■^°'-A-:^)A(i?,ef)di?,.ef, 



a^N 



2a^N 



(9) 



where A(i?rof) is a conrplicated expression independent of A. Replacing ii'rcf with its approximate value in the 
exponential and neglecting the term quadratic in A, we obtain 



piEi - Eo ^ A) = exp - 



2c 



(10) 



where we also used the fact that the probabihty is 1 when A = 0. The same result is obtained for E2 — Ei, E3 — E2, 
etc. At the edge of the spectrum, the spacings are exponentially distributed with mean aj^/2c, i.e. levels have Poisson 
statistics. These results are not new: they are well-known in extreme value statistics Q and appear in a solution of 
Derrida's random energy model. 

When a new random clause is added, a fraction of the solutions will disappear. If we neglect any correlations as 
before, we can assume that each solution will satisfy the new clause with finite probability p < 1. Conditioned on the 
fact that the "old" ground state contradicts the new clause, the probability that old fc-th excited state satisfies it, but 
1st, 2nd, {k ~ l)-st excited states contradict it. 



fc-i 



Pk ^p{l-p) 
The gap between old ground and fc-th excited state Ek — Eq is distributed with probability density 



(11) 



Pk{x) 



2c 



„k-l 



. vli. 



(fc-1)! 



(12) 



Therefore, the distribution of spacing between the old ground and lowest-lying excited state satisfying the new clause 



Pi^) ^^PkPk{x) = 



2c 



PV^., 



(13) 



fc=i 



an exponential distribution with mean 



- fi.(M-l), 



)V2c' 



Strictly speaking, this is not the same as the distribution of the correct 



p(M-l) 



quantity AEio{X) = Exg ~ (A) — E^^ ~ (A), where Xi and Xq correspond to the ground state of instance with M — 1 
and M clauses respectively. The addition of new clause introduces a configuration-dependent correction O(A^) which 
is comparable to O(A^) level spacing. This means that the levels are somewhat "reshuffled", i.e. old (fc -I- l)-st excited 
state may become smaller in energy than old k-th excited state. We therefore expect that the distribution of energy 
differences will deviate from true exponential, but the characteristic scale should still be O(A^) with no TV-dependence. 
Fig. [3] illustrates the distribution of AE'io for a particular random instance; for large N the distribution still has an 
exponential tail, but the middle of the distribution slightly deviates from true exponential. 




FIG. 3: The values of normalized level splittings A_Eio/A* for 4000 random instances with A'^ = 200 and A'" = 1000, sorted in 
a decreasing order. Each dot's y-coordinate is the value of the spHtting and the a;-coordinate is its index k in the decreasing 
sequence. A straight line on semilogarithmic plot would correspond to exponential distribution. Deviation from true exponential 
is noticeable for N = 1000. A clause-to-variable ratio is fixed to M/N — 0.62. 



We should mention that approximating the distribution of configuration-dependent 4th order corrections can be 
approximated by a Gaussian only ior E — E ^ a/ZVA^, but Eq corresponds to the tail of the distribution where this 



approximation is not valid. The probability density of the sum of 0{N) random variables, each having variance 
O(A^) is expected to be exponentially small when we are 0(iVA'*) away from the mean; the exact dependence can be 
computed by considering optimal fluctuations. Hence, we still expect that E — Eq ^ NX'^. Similarly, level spacing 
El — Eq ~ A*, although there is no guarantee that it is exponential-distributed. Therefore, our conclusions are 
independent of this approximation. 

Since AE'io ~ A"*, avoided crossings should not take place until A ^ 1. But for these values of A, higher orders of 
perturbation theory may not be discarded and the perturbation theory itself may become divergent, as it should near 
the quantum phase transition. 

Our prediction is in apparent disagreement with the results of numerical simulations of Ref. [1| that seem to support 
the claim that AEiq ^ ^/NX^. Ref. [Jl correctly examined the edge of the spectrum: all solutions were enumerated 
and the 4th order perturbation theory corrections both before and after adding the new clause were computed for Xi 
and Xq that would correspond to the local and global minima, i.e. having the smallest perturbation theory correction. 
The average, median and percentiles of p [(A£'io/A^)^] as a function of N for up to A^ = 200 were plotted and a 
linear fit was found. However, as we mentioned earlier, for N ^ 100 the effects of exponential degeneracy are not 
yet prominent. Had the simulation been extended to larger values of N, the flattening of the curves would have been 
observed suggesting a finite limit as A^ — > oo. 

Numerical results. We have extended the numerical study of Ref. [l| to much larger values of A''. A complete 
enumeration of all solutions becomes prohibitively time-consuming as the number of solutions explodes. However, we 
are really interested in a solution with the smallest 4th order perturbation theory correction. From Eq. ([6]) it is seen 
that this correction is linear in binary variables. Finding a solution corresponding to the ground state is equivalent to 
solving integer linear programming (ILP) problem, for which we utilize standard software packages [9]. ILP algorithms 
are more efficient than approaches based on a complete enumeration as entire branches corresponding to suboptimal 
solutions are pruned using e.g. LP relaxations as a lower bound. 
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FIG. 4: Left: Average as well as 75"\ 50"^ (median), and 25"^ percentiles of distribution of (AE'io/A"')^ for different values of 
A*' (cf. Fig. 2 from Ref. [J|). Dashed lines are linear fits for 50 ^ N ^ 200. Right top: Just the median of the distribution as a 
function of A^. Right bottom: Average and percentiles of p(A_Bio/A^) (not squared!) on a log-log plot. Dashed lines correspond 
to power-law fits in the interval 100 ^ A'' ^ 1000. The exponents obtained were: 0.33 ± 0.01 (average), 0.23 ± 0.01 (75%), 
and 0.13 ± 0.01 (median and 25%); here error estimates refer only to the goodness of fit. The estimates of the exponents are 
unreliable since only one decade of value of A' was included in the fit. Errorbars correspond to one standard deviation (68%). 
A clause-to variable ratio is M/N = 0.62. The results are about 20% larger than in Ref. [l|. The discrepancy might be due 
to minor difference in the numerical procedure: we chose the added clause at random among those that contradict xi, while 
Ref. (l|| restarted from scratch if random clause did not contradict xi. The difference is not essential since the the probability 
that a random clause contradicts a particular assignment is finite, but since this probability depends on the number of ones 
and zeros in xi, the distributions are not identical. 



In Fig. [5] one can see that the curves are levehng off for larger values of N, in agreement with our argument that 
AEiQ should not scale with N. The fact that the average square of the gap and the 75th percentile are so close to 
each other for A^ ^ 200 (also seen in Ref. [i|) is not coincidental. It is an indirect evidence that the distribution of 
AE is close to exponential since 1 — e~^^ w 0.757. For larger N, the distribution is not exponential, possibly due to 
above-mentioned reshuffling of energy levels as their density is increased. 

The flattening of the curve corresponding to the median is quite pronounced. Since average is more sensitive to the 
tails of the probability distribution, even larger values N may be needed to show its approach to the limiting value 

at iV — ^ oo. 

Of course, the present numerical study cannot completely rule out the possibility that A_Eio still increases with N with 
a power-law exponent smaller than 1/2. Indeed, the median (which is more statistically robust measure of scale than 
the average) seems to grow as A''"-'^^ in the interval 100 ^ A^ ^ 1000. Tails of the distribution might be responsible 
for larger exponents observed for the 75**^ percentile and the average. If the corrections were to grow indefinitely, for 
sufficiently large N they would be large enough to cause avoided crossings. With the assumption that corrections 
increase as 7V^/^, Ref. lv\ claims that the mechanism may only set in for very large N > Nc, where the threshold had 
been estimated as either Nc « 5400 or Nc ~ 86000 depending on assumptions made. If the corrections were to rise 
only as N'^-^^ rather than N'^-^, the value of Nc would be pushed even higher. We expect that an observed power- 
law fit with a finite value of the exponent is an artifact of using too short an interval (between 100 and 1000). An 
observation that the exponent is close to 1/ In 1000 (corresponding to the largest size considered) suggests a possibility 
that corrections increase as a logarithm of N. A logarithmic rise would violate the condition A* < 1/logiV given in 
Ref. fr|: indeed, a central point of its argument is the claim that corrections increase as a finite power of N, or much 
faster than a logarithm. The less stringent condition conjectured there would be satisfied, but the corresponding value 
of Nc might be astronomically large. 

Numerical results clearly contradict the square-root-of-7V scaling, but cannot reliably distinguish an approach to a 
finite limit from an extremely slow increase with N (e.g. as a logarithm). Based on numerical study alone, this 
scenario cannot be ruled out, but the theoretical analysis of the previous section, although imprecise, suggests that 
the corrections approach a finite limit as N ^- oo. But we can think of no reason that might cause a plausible 
logarithmic rise. 

Concluding remarks. Wc want to highlight one important limitation of the perturbation theory approach. Even 
for the "trimmed" ensemble considered in Ref. [l|, strictly speaking the largest configuration-dependent correction is 
not O(A^) but rather O(A^), the latter coming from degenerate perturbation theory. Indeed, consider two clauses 
connected to the remainder of the graph as depicted in Fig. [S] (left). If both a;i = 2:2 = then (a;3,a;4,X5) can 
be assigned either (0,1,0) or (1,0,1). Since the two configurations with the same energy differ by 3 bit flips, the 
splitting caused by the degenerate perturbation theory causes O(A^) correction to the energy. It can be argued that 
such clauses can be removed: since they can be satisfied for any value of xi and X2 they only contribute to trivial 
degeneracies. However, in a similar example involving three clauses [see Fig. [5] (right)], they cannot be removed and 
yet they contribute 0{X^) due to the degenerate perturbation theory correction — the same order as the correction 
due to ordinary perturbation theory. In other problems the effect of degenerate perturbation theory can be stronger: 
for K-SAT it enters as 0(A) correction. The difficulty of dealing with contributions from the degenerate perturbation 
theory is a need to diagonalize matrix involving many solutions. Although ordinary perturbation theory is inadequate, 
we believe that our main contention, that AE does not scale with N, is still correct. 

The crucial factor in our analysis is the existence of exponentially many solutions. This phenomenon is common to all 
combinatorial optimization problems defined on random hypergraphs. One might ask if in some models hypergraph 
"trimming" may lift this degeneracy. One such example is ilT-XOR-SAT problem, where exponential degeneracy can 
be removed right at the satisfiability threshold by such trimming. However, perturbative corrections are independent 
of bit assignments to all orders of perturbation theory, and the mechanism described in Ref. py is not applicable 
there. This is probably not coincidental: unless local energy landscapes are identical in the vicinity of all solutions, 
the exponential degeneracy may not be removed by only geometric transformations of the random hypergraph. 

While we refute the claim that exponentially small gaps appear with high probability for A — >■ 0, the general possibility 
of exponentially small gaps for finite A < Ac cannot be ruled out. But estimating the probability of their occurrence 
might require using non-perturbative approaches. 

We acknowledge the financial support of the United States National Security Agency's Laboratory for Physical 
Sciences. We also acknowledge the support with computational resources (32-node Linux cluster) from the United 



remainder of instance remainder of instance 

FIG. 5: Left: An example of 0{\ ) contribution from the degenerate perturbation theory. If xi — X2 = 0, two allowed 
assignments of variables (xa, 2:4, xs): (0, 1,0) and (1,0, 1) differ by three spin flips. Right: An example of 0(A*) contribution 
from the degenerate perturbation theory. (14, X5, xe, xr) can be either (0, 1, 0, 1) or (1, 0, 1, 0) if xi = X2 ~ xs — 0. The clauses 
cannot be removed without affecting the satisfiability of the instance: they prohibit an assignment xi — x-^ = 1, X2 = 0. In 
each figure solid dots represent binary variables and triangles represent clauses in an instance of exact cover problem. Binary 
variables below the dashed lines are involved in other clauses as indicated by zigzag lines. 
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